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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/125 , 031A 



DATE: 08/15/2000 
TIME: 13:52:15 



ENTERED 



3 
4 
5 
6 
7 



<110> 



ROTH, CHARLES 
BARNWELL, JOHN 
MENDIS, KAMINI 



APPLICANT: LONGACRE- ANDRE , SHIRLEY 



NATO, FARIDABANO 



9 <120> TITLE OF INVENTION: RECOMBINANT PROTEIN CONTAINING A C -.TERMINAL FRAGMENT OF 



12 <130> FILE REFERENCE: 0660 - 0139 -OXPCT 

14 <140> CURRENT APPLICATION NUMBER: 09/125, 031A 

15 <141> CURRENT FILING DATE: 1999-03-10 

17 <150> PRIOR APPLICATION NUMBER: PCT/FR97/00290 

18 <151> PRIOR FILING DATE: 1997-02-14 

20 <150> PRIOR APPLICATION NUMBER: FR96/01822 

21 <151> PRIOR FILING DATE: 1996-02-14 
23 <160> NUMBER OF SEQ ID NOS : 15 

25 <170> SOFTWARE: Patentln Ver. 2.1 

27 <210> SEQ ID NO: 1. 

28 <211> LENGTH: 291 

29 <212> TYPE: DNA 

30 <213> ORGANISM: Artificial Sequence 

32 <220> FEATURE: 

33 <223> OTHER INFORMATION: Description of Artificial Sequence: SYNTHETIC 

35 <220> FEATURE: 

36 <221> NAME/KEY: CDS 

37 <222> LOCATION: (1)..(291) 
39 <400> SEQUENCE: 1 



40 


gaa 


ttc 


aac 


ate 


teg 


cag 


cac 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


48 


41 


Glu 


Phe 


Asn 


lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




42 


1 








5 










10 










15 






44 


aac 


tct 


ggc 


tgt 


ttc 


aga 


cac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


96 


45 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




46 








20 










25 










30 








48 


ctg 


ctg 


aac 


tac 


aaa 


cag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


144 


49 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 




50 






35 










40 










45 










52 


ccg 


acc 


tgt 


aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gca 


gac 


gec 


aaa 


tgc 


192 


53 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




54 




50 










55 










60 












56 


acc 


gag 


gag 


gac 


teg 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


acg 


tgt 


gag 


tgt 


240 


57 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 




58 


65 










70 










75 










80 




60 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


288 


61 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp Gly 


He 


Phe 


Cys 


Ser 






62 










85 










90 










95 







64 taa 291 

68 <210> SEQ ID NO: 2 

69 <211> LENGTH: 95 



10 



PLASMODIUM MSP-1 
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70 <212> TYPE: PRT 

71 <213> ORGANISM: Artificial Sequence 
W--> 72 <220> FEATURE: 



72 <223> OTHER INFORMATION: Description of Artificial Sequence: SYNTHETIC ?6/) 



74 



<400> SEQUENCE: 


2 
























Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


1 








5 










10 










15 




Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 








20 










25 










30 






Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 






35 










40 










45 








Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




50 










55 










60 










Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


65 










70 










75 










80 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 





76 
77 
78 
79 
80 
81 
82 
83 
84 
85 

86 85 90 95 

90 <210> SEQ ID NO: 3 

91 <211> LENGTH: 279 

92 <212> TYPE: DNA 

93 <213> ORGANISM: Plasmodium falciparum 

95 <4 00> SEQUENCE: 3 

96 aacatttcac aacaccaatg cgtaaaaaaa caatgtccag aaaattctgg atgtttcaga 60 

97 catttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga aggtgataaa 120 

98 tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgcagatgcc 180 

99 aaatgtaccg aagaagattc aggtagcaac ggaaagaaaa tcacatgtga atgtactaaa 240 

100 cctgattctt atccactttt cgatggtatt ttctgcagt 279 

103 <210> SEQ ID NO: 4 

104 <211> LENGTH: 354 

105 <212> TYPE: DNA 

106 <213> ORGANISM: Artificial Sequence 

108 <220> FEATURE: 

109 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
111 <220> FEATURE: 



112 <221> NAME/KEY 

113 <222> LOCATION 
115 <400> SEQUENCE 



CDS 

(1) . - (354) 
4 



116 gaa ttc aac ate teg cag cac caa tgc gtg aaa aaa caa tgt ccc gag 4 8 

117 Glu Phe Asn He Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu 

118 15 10 15 

120 aac tct ggc tgt ttc aga cac ttg gac gag aga gag gag tgt aaa tgt 96 

121 Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys 

122 20 25 30 

124 ctg ctg aac tac aaa cag gag ggc gac aag tgc gtg gag aac ccc aac 144 

125 Leu Leu Asn Tyr Lys Gin Glu Gly Asp Lys Cys Val Glu Asn Pro Asn 

126 35 40 45 

128 ccg acc tgt aac gag aac aac ggc ggc tgt gac gca gac gec aaa tgc 192 

129 Pro Thr Cys Asn Glu Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys Cys 

130 50 55 60 
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132 


acc 


gag 


gag 


gac 


teg ggc 


age aac ggc aag aaa 


ate 


acg 


tgt 


gag 


tgt 


240 


133 


Thr 


Glu 


Glu 


Asp 


Ser Gly 


Ser Asn Gly Lys Lys 


He 


Thr 


Cys 


Glu 


Cys 




134 


65 








70 


75 










80 




136 


acc 


aaa 


ccc 


gac 


teg tac 


ccg ctg ttc gac ggc 


ate 


ttc 


tgc 


age 


tec 


288 


137 


Thr 


Lys 


Pro 


Asp 


Ser Tyr 


Pro Leu Phe Asp Gly 


He 


Phe 


Cys 


Ser 


Ser 




138 










85 


90 








95 






140 


tct 


aac 


ttc 


ttg 


ggc ate 


teg ttc ttg ttg ate 


etc 


atg 


ttg 


ate 


ttg 


336 


141 


Ser 


Asn 


Phe 


Leu 


Gly He 


Ser Phe Leu Leu He 


Leu 


Met 


Leu 


He 


Leu 




142 








100 




105 






110 








144 


tac 


age 


ttc 


att 


taa taa 














354 


145 


Tyr 


Ser 


Phe 


He 


















146 






115 






















149 <210> SEQ ID NO: 5 

150 <211> LENGTH: 116 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Artificial Sequence 
W--> 153 <220> FEATURE: 



153 


<223> OTHER 


INFORMATION; 


: Description of 


Artificial Sequence: 


: SYNTHETIC 


155 


<400> SEQUENCE: 


5 






















156 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


157 


1 








5 








10 










15 




158 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


159 








20 








25 










30 






160 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin Glu Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


161 






35 








40 










45 








162 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


163 




50 










55 








60 










164 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


165 


65 










70 








75 










80 


166 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 


Ser 


167 










85 








90 










95 




168 


Ser 


Asn 


Phe 


Leu 


Gly 


He 


Ser Phe 


Leu 


Leu 


He 


Leu 


Met 


Leu 


He 


Leu 


169 








100 








105 










110 






170 


Tyr 


Ser 


Phe 


He 
























171 






115 



























175 <210> SEQ ID NO: 6 

176 <211> LENGTH: 342 

177 <212> TYPE: DNA 

178 <213> ORGANISM: Plasmodium falciparum 

180 <400> SEQUENCE : 6 

181 aacatttcac aacaccaatg cgtaaaaaaa caatgtccag aaaattctgg atgtttcaga 

182 catttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga aggtgataaa 

183 tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgeagatgee 

184 aaatgtaccg aagaagattc aggtagcaac ggaaagaaaa tcacatgtga atgtactaaa 

185 cctgattctt atccactttt cgatggtatt ttctgcagtt cctctaactt cttaggaata 

186 tcattcttat taatactcat gttaatatta tacagtttca tt 

189 <210> SEQ ID NO: 7 

190 <211> LENGTH: 387 

191 <212> TYPE: DNA 



60 

120 

180 

240 

300 

342 



file://C:\CRF3\Outhold\VsrI 1 2503 1 A.htm 



8/15/00 



Page 4 of 7 



RAW SEQUENCE LISTING DATE : 08/15/2000 



PATENT APPLICATION : US/09/12 5 , 03 1A TIME: 13:52:15 ffijj^ 

Input Set : A:\660139.app 

Output Set: N:\CRF3\08152000\I125031A.raw 



192 


<213> ORGANISM: 


Plasmodium falciparum 














194 


<220> FEATURE: 


























195 


<221> NAME/KEY: 


CDS 
























196 


<222> LOCATION: 


(1) 


.(387) 




















198 


<400> SEQUENCE: 


7 
























199 


atg 


aag 


gcg 


eta 


etc 


ttt 


ttg 


ttc 


tct 


ttc 


att 


ttt 


ttc 


gtt 


acc 


aaa 


200 


Met 


Lys 


Ala 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


He 


Phe 


Phe 


Val 


Thr 


Lys 


201 


1 






5 










10 










15 




203 


gaa 


ttc 


aac 


ate 


teg 


cag 


cac 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


204 


Glu 


Phe 


Asn 


lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


205 








20 










25 










30 






207 


gaa 


ttc 


aac 


ate 


teg 


cag 


cac 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


208 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


209 






35 










40 










45 








211 


aac 


tct 


ggc 


tgt 


ttc 


aga 


cac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


212 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


213 




50 










55 










60 










215 


ctg 


ctg 


aac 


tac 


aaa 


cag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


216 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


217 


65 










70 










75 










80 


219 


ccg 


acc 


tgt 


aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gca 


gac 


gec 


aaa 


tgc 


220 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


221 










85 










90 










95 




223 


acc 


gag 


gag 


gac 


teg 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


acg 


tgt 


gag 


tgt 


224 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


225 








100 










105 










110 






227 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


228 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 




229 






115 










120 










125 








231 


taa 
































235 


<210> SEQ ID NO 


8 
























236 


<211> LENGTH: 127 
























237 


<212> TYPE: 


PRT 


























238 


<213> ORGANISM: 


Plasmodium falciparum 














240 


<400> SEQUENCE: 


8 
























241 


Met 


Lys 


Ala 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


He 


Phe 


Phe 


Val 


Thr 


Lys 


242 


1 






5 










10 










15 




243 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


244 








20 










25 










30 






245 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


246 






35 










40 










45 








247 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


248 




50 










55 










60 










249 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


250 


65 










70 










75 










80 


251 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


252 










85 










90 










95 




253 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


254 








100 










105 










110 







48 



96 



144 



192 



240 



288 



336 



384 



387 
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255 
256 
260 
261 
262 
263 
265 
266 
267 
269 
270 
271 
272 
274 
275 
276 
278 
279 
280 
282 
283 
284 
286 
287 
288 
290 
291 
292 
294 
295 
296 
299 
300 
301 
302 
304 
305 
306 
307 
308 
309 
310 
311 
312 
313 
314 
315 
316 
317 



Thr Lys Pro Asp Ser Tyr Pro Leu 
115 120 
SEQ ID NO: 9 
LENGTH: 330 
TYPE: DNA 



Phe Asp Gly lie Phe Cys Ser 
125 



<210> 
<211> 
<212> 
<213> ORGANISM 
FEATURE: 
NAME/KEY 
LOCATION 



Plasmodium falciparum 



<220> 
<221> 
<222> 



<400> SEQUENCE 



CDS 

(1). 

9 



•(330) 



gaa aca gaa agt tat aag cag ctt 

Glu Thr Glu Ser Tyr Lys Gin Leu 

1 5 

ate teg cag cac caa tgc gtg aaa 

He Ser Gin His Gin Cys Val Lys 
20 

tgt ttc aga cac ttg gac gag aga 

Cys Phe Arg His Leu Asp Glu Arg 

35 40 

tac aaa cag gag ggc gac aag tgc 

Tyr Lys Gin Glu Gly Asp Lys Cys 



50 



55 



aac gag aac aac ggc ggc tgt gac 
Asn Glu Asn Asn Gly Gly Cys Asp 

65 70 
gac teg ggc age aac ggc aag aaa 
Asp Ser Gly Ser Asn Gly Lys Lys 
85 

gac teg tac ccg ctg ttc gac ggc 
Asp Ser Tyr Pro Leu Phe Asp Gly 
100 

<210> SEQ ID NO: 10 
<211> LENGTH: 108 
<212> TYPE: PRT 

<213> ORGANISM: Plasmodium falc 

<400> SEQUENCE: 10 

Glu Thr Glu Ser Tyr Lys Gin Leu 

1 5 
He Ser Gin His Gin Cys Val Lys 
20 

Cys Phe Arg His Leu Asp Glu Arg 



gta gee 
Val Ala 
10 

aaa caa 
Lys Gin 

25 

gag gag 
Glu Glu 

gtg gag 
Val Glu 

gca gac 
Ala Asp 

ate acg 
He Thr 
90 

ate ttc 
He Phe 
105 



aac gtg 
Asn Val 

tgt ccc 
Cys Pro 

tgt aaa 
Cys Lys 

aac ccc 
Asn .Pro 
60 

gee aaa 
Ala Lys 

75 
tgt gag 
Cys Glu 

tgc age 
Cys Ser 



gac gaa ttc aac 
Asp Glu Phe Asn 
15 

gag aac tct ggc 
Glu Asn Ser Gly 

30 

tgt ctg ctg aac 
Cys Leu Leu Asn 
45 

aac ccg acc tgt 
Asn Pro Thr Cys 

tgc acc gag gag 
Cys Thr Glu Glu 
80 

tgt acc aaa ccc 
Cys Thr Lys Pro 
95 

taa taa 
110 



35 



40 



Tyr Lys. Gin Glu Gly Asp Lys Cys 

50 55 
Asn Glu Asn Asn Gly Gly Cys Asp 

65 70 
Asp Ser Gly Ser Asn Gly Lys Lys 
85 

Asp Ser Tyr Pro Leu Phe Asp Gly 



lparum 

Val Ala 
10 

Lys Gin 

25 
Glu Glu 

Val Glu 

Ala Asp 

He Thr 
90 

He Phe 



Asn Val Asp Glu Phe Asn 
15 

Cys Pro Glu Asn Ser Gly 
30 

Cys Lys Cys Leu Leu Asn 
45 

Asn Pro Asn Pro Thr Cys 
60 

Ala Lys Cys Thr Glu Glu 
75 80 
Cys Glu Cys Thr Lys Pro 
95 

Cys Ser 



48 



96 



144 



192 



240 



288 



330 
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